Nearest Neighbour Classification with Background Knowledge Extended to Semi-supervised Learning

نویسندگان

  • JASON CHAN
  • IRENA KOPRINSKA
  • JOSIAH POON
  • Jason Chan
  • Irena Koprinska
  • Josiah Poon
چکیده

Semi supervised methods involve converting unlabelled data into high quality labelled data that can be used to improve the performance of conventional supervised methods that had previously been given a small training set. Unlabelled data has also been shown to be helpful in a supervised setting called ‘bridging’ where unlabelled data have been used to help relate labelled instances to those that are being classified. In this paper, we propose a new supervised bridging method that can be used to improve existing semisupervised methods in certain problem settings.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Local nearest neighbour classification with applications to semi-supervised learning

We derive a new asymptotic expansion for the global excess risk of a local k-nearest neighbour classifier, where the choice of k may depend upon the test point. This expansion elucidates conditions under which the dominant contribution to the excess risk comes from the locus of points at which each class label is equally likely to occur, but we also show that if these conditions are not satisfi...

متن کامل

Enhancing the Performance of Semi-Supervised Classification Algorithms with Bridging

Traditional supervised classification algorithms require a large number of labelled examples to perform accurately. Semi-supervised classification algorithms attempt to overcome this major limitation by also using unlabelled examples. Unlabelled examples have also been used to improve nearest neighbour text classification in a method called bridging. In this paper, we propose the use of bridgin...

متن کامل

Scene Classification Via pLSA

Given a set of images of scenes containing multiple object categories (e.g. grass, roads, buildings) our objective is to discover these objects in each image in an unsupervised manner, and to use this object distribution to perform scene classification. We achieve this discovery using probabilistic Latent Semantic Analysis (pLSA), a generative model from the statistical text literature, here ap...

متن کامل

Semi-Supervised Self-Organizing Feature Map for Gene Classification

In this thesis, a study on gene expression data analysis is done using some supervised, unsupervised and semi-supervised approaches. The task of class prediction for six gene expression datasets (namely, Brain Tumor, Colon Cancer, Leukemia, Lymphoma and SRBCT) has been carried out. Here, a one-dimensional self-organizing feature maps (SOFM) in a semi-supervised learning framework is developed f...

متن کامل

Unsupervised Feature Learning via Non-Parametric Instance Discrimination

Neural net classifiers trained on data with annotated class labels can also capture apparent visual similarity among categories without being directed to do so. We study whether this observation can be extended beyond the conventional domain of supervised learning: Can we learn a good feature representation that captures apparent similarity among instances, instead of classes, by merely asking ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005